Maintaining Spatial Data Sets in Distributed-Memory Machines

نویسندگان

  • Susanne E. Hambrusch
  • Ashfaq A. Khokhar
چکیده

We propose a distributed data structure for maintaining spatial data sets on message-passing, distributed memory machines. The data structure is based on orthogonal bisection trees and it captures relevant characteristics of parallel machines. The operations we consider include insertion, deletion, and range queries. We introduce parameters to control how much imbalance is tolerated at each processor and to specify the load to be achieved during balancing. When balancing, we first broadcast point counts of a datadependent partition of the data. Based on this partition, we propose load balancing methods with different communication and computation requirements. We present initial experimental results for the Cray T3D.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Entropy-based Consensus for Distributed Data Clustering

The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...

متن کامل

Distributed and Memory Efficient Machine Learning for Spatial Analysis Applications

In the context of spatial analysis, spatial disaggregation or spatial downscaling are processes by which information at a coarse spatial scale is translated to finer scales, while maintaining consistency with the original dataset. Fine grained descriptions of geographical information is a key resource in fields such as social-economic studies, urban and regional planning, transport planning, or...

متن کامل

Compiler Techniques for Determining Data Distribution and Generating Communication Sets on Distributed-Memory Machines

This paper is concerned with designing eficient algorithms for determining data distribution and generating communication sets on distributed memory multicomputers. First, we propose a dynamic programming algorithm to automaliically determine data distribution at compiling time. The proposed algorithm also can determine whether da-La redistribution is necessary between two consecutive DO-loop p...

متن کامل

Munin: Distributed Shared Memory Based on Type{speciic Memory Coherence

We are developing Munin y , a system that allows programs written for shared memory multiprocessors to be executed eeciently on distributed memory machines. Thus, Munin overcomes the architectural limitations of shared memory machines, while maintaining their advantages in terms of ease of programming. A unique characteristic of Munin is the mechanism by which the shared memory programming mode...

متن کامل

Compiling Array Statements for E cient Execution onDistributed - Memory Machines : Two - level

In languages such as High Performance Fortran (HPF), array statements are used for expressing data parallelism. In compiling array statements for distributed-memory machines, eecient enumeration of local index sets and communication sets is important. The virtual processor approach, among several other methods, has been proposed for eecient enumeration of these index sets. In this paper, using ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997